Skip to content

[Revision] Add fast decode plan for flashinfer mla #4012

Merged
zhyncs merged 3 commits intosgl-project:mainfrom
Fridge003:deepseek
Mar 5, 2025
Merged

[Revision] Add fast decode plan for flashinfer mla #4012
zhyncs merged 3 commits intosgl-project:mainfrom
Fridge003:deepseek

Conversation

@Fridge003
Copy link
Copy Markdown
Collaborator

Motivation

Revision of #3987

@merrymercy @zhyncs @Ying1123

Modifications

Checklist

@Fridge003 Fridge003 force-pushed the deepseek branch 2 times, most recently from 413ffcc to d67566c Compare March 3, 2025 17:56
@zhyncs zhyncs mentioned this pull request Mar 3, 2025
12 tasks
@zhyncs
Copy link
Copy Markdown
Collaborator

zhyncs commented Mar 4, 2025

@merrymercy

@zhyncs zhyncs merged commit fc91d08 into sgl-project:main Mar 5, 2025
3 of 18 checks passed
@Fridge003 Fridge003 deleted the deepseek branch March 5, 2025 21:36
aoshen524 pushed a commit to aoshen524/sglang that referenced this pull request Mar 10, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants